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(54) Apparatus, method and recording medium lor speech synthesis 



(57) Various sensors detect conditions outside a ro- 
bot and an operation applied to the robot, and output the 
results of detection to a robot-motion-system control 
section. The robot-motion-syslem control section deter- 
mines a behaviour state according to a behaviour mod- 
el. A robot-thinking-system control section determines 
an emotion state according to an emotion model. A 
speech-synthesizing-control-infomiation selection sec- 
tion determines a field on a speech -synthesizing-con- 



trol-information table according to the behaviour state 
and the emotion state. A language processing section 
analyzes in grammar a text for speech synthesizing sent 
from the robot-thinking-system control section, converts 
a predetermined portion according to a speech-synthe- 
sizing control infomnation, and outputs to a rule-based 
speech synthesizing section. The rule-based speech 
synthesizing section synthesizes a speech signal corre- 
sponding to the text for speech synthesizing. 
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Description . : : 

BACKGROUND OF THE INVErvfTION -.y o . . r; ; ; 

5 1. Field of the Invention * ■ . . . 

[0001] The present invention relates to a speech' synthesizing apparatus and method, a recording medium and a 
computer program product. Embodiments of the invention relate to a speech synthesizing apparatus, a speech syn- 
thesizing method, and a recording medium which are mounted, for example, to a robot to change a speech signal to 
10 be synthesized according to the emotion and behaviour of the robot. 

2. Description of the Related Art ' ■ < ■ 

[0002] There have been robots which utter words. If such robots change their emotions and change the way of 
*5 speaking according to the emotions, or if they change the way of speaking according to their personalities specified 
for them, such as types, genders, ages, places of birth, characters, and physical characteristics, they imitate living 
things more reial/ . i. -j^i ; : , , , <; . ^ y^ f 

[0003] The user will contactsuch robots with friendship and love as If they were pets. The problem is that such robots 
have not yet been implemented. 
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SUMMARY OF THE INVENTION 



[0004] Embodiments of the present invention seek to provide ia roBot which changes the way of speaking according 
to the emotion and behaviour to imitate living things more real' "'^ r i j . . - 

25 [0005] One aspect of the preisent invention provides' a i^F>e^ch synthesizing apparatus for synthesizing a speech 
signal corresponding to a text, including behaviour-state changmg mea^ for changing a behaviour state according to 
a behaviour model; emotion-state changing means for chahgihg an eniotion state according to an emotion model; 
selecting means for selecting control information accbixlihg to at least'one of the behaviour state and the emotion state; 
and synthesizing means for synthesizing a speech signal correspondihg to the text aoiordtng to speech synthesizing 

30 infonnation included in the control information selected by the selectirig means. ■ 

[0006] A speech synthesizing apparatus of an embodiment of the present invention may be configured such that it 
further includes detecting means for detecting an external condition and the selecting rneahs iselects the control infor- 
mation also acxjording to the result of detection achieved By the dete 

[0007] A speech synthesizing apparatus of ah erhbodiment of the pr^eserit invention rriay be configured such that it 
55 further includes holding means for holding individual information and the selecting means selects the control infonnation 
also according to the individual infonnation held by the holding means. 

[0008] A speech synthesizing apparatus of an embodiment of the present invention may be configured such that it 
further includes counting means for counting the Elapsed time from activah^ means selects the 

coritrol information also according to the elapsed time counted by the^^ra / . 

40 [0009] A speech synthesizing apparatus of an erhbodinrient of the present invention may be configured such that it 
further incliJdes accumulating means for accumulating at (east one of tHe number of times the behaviour-state changing 
means changes behaviour states and the number of times' the emotion-state changing means changes emotion states 
and the selecting means selects the control information also according to the number of tirnes accumulated by the 
accumulating means. ' ' ^ ' ' 

[0010] A speech synthesizing apparatus of an embodimerit of the present invention may further include substituting 
means for substituting for words included in the text by using a word substitute dictionary corresponding to selection 
information included in thd contrpi information selected by the s^^ » 
[0011] A speech synthesizing apparatus of an embodiment of the present invention rhay further include converting 
means for converting the style of the text according to a style conversion rule correspondihg to selection informiation 
so included in the control infonrnation selected by the selecting rneans. ' • . . 

[0012] Another aspect of the preserit invention provides a speefch synthesizing method for a speech synthesizing 
apparatus for synthesizing a speech signal corresporidirig to a text, Including a behaviour-state chariging step of chang^ 
ing a behaviour state according to a behaviour model; an emotiori-state changing step of changing ah emotion state 
according to an emotion rhodel; a selecting stiep of selecting control infonrnation according to at least one of the be- 
haviour state and the eniotion state; ^nd a synthesizing step of synthesizing a speech signal con^esponding .to the text 
according to speech syntheisizihg information included in the control information selected by the process of the selecting 
step. ■ ' • .'' — i/- =- • ' r V V.:. f . •= 

[0013] Still another aspect of tli6 present inveritiori providies a recording medium storing a bbmputer- read able 
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speech-synthesizing program for synthesizing a speech signal corresponding to a text^ the program including a he- 
hayiour-state changing step of changing a behaviour state according to a behaviour model; an emotion-state changing 
step of changing an emotton state according to an emotion model; a selecting step of selecting control infomiation 
according to at least one of the behaviour state and the emotion state; and a synthesizing step of synthesizing a speech 
5 signal corresponding to the text according to speech synthesizing information included in the control infomiation se- 
lected by the process of the selecting step. 

[0014] In a speech synthesizing apparatus, a speech synthesizing method, and a program stored in a recording 
medium according to the present invention, a behaviour state is changed according to a behaviour model and an 
emotion state is changed according to an emotion nriodel. Control information is selected according to at least one of . 
10 the behaviour state and the emotion state. A speech signal is synthesized corresponding to a text according to speech 
synthesizing information included in the selected control infomnation. 

[0015] The present invention also provides a computer program product having instructions for carrying out the 
invention when run on a suitable data processing system. 

15 BRIEF DESCRIPTION OF THE DRAWINGS ^ 

[0016] For a better understanding of the present Invention, reference will now be made by way of example to the 
accompanying drawings, in which:- . . ^ . , - 

^0 ^ Fig. 1 is a block diagram showing an example structure of a portion related to speech synthesizing of a robot to 
which an embodiment of the present invention is applied. 

Fig. 2 is a block diagram showing an example structure of a robot-motion-system control section 10 and a robot- 
thinking-system control section 11 shown J nFjj^l,,.,^ .... ' f 
Fig. 3 is a view showing a behaviour model 32 shown jn Fig. 2. . . , . . 
25 FigM is a view showing an emotion model 42. jBhown. in F^^ ... 
. Fig. 5 is a view showing spefK:hrsynth^sazing.coi}tro 

Fig. 6 is a block diagram showing.a detailed example j^tructu re of a language processing section 14. 
. Fig. 7 is a flowchart showing the operation pf/thyrobbrt^ which an embodiment of the present invention is applied. 
Fig, 8 is a block diagram showing another exarnple structure of the portion related to: speech synthesizing of the , 
30 robot to which an embodiment of the present iny^eotip^ 

Fig. 9^ is a block diagram shqvying stiii anpther-exampl^ structure of the portion related to speech synthesizing of 
.the rot)ot to which an enibodiment of the present j,nvention is applied. 

Fig. 1 0 is a block diagram showing yet another.example ^tructure_of the portion related to speech synthesizing of 
the robot to. which an embodiment f>f the present i^^^ - , 

35 . ~ " .... Y I --r -i: ' ? • ' ' : • 

DESCRIPTION OF THE PREFERRED EMBODIMEN^^ 

[0017] Fig..1 shows an ekarnple structure of" a pprtipn, related to speeph synthesizing, in a robot to which an embod- 
iment of the present invention is applied, this robot has a wordTUtteranpe function, changes the emotion and behaviour, 

40 andchangesthewayof speaking according tachs^ 

[0018]. Various iensors 1 detect conditionspi^ide the robot arid an operation applied to the rol^ot, and output the 
results of detection to a rpbot^motion-system controrsectiori 10. For . example, an outside-temperature sensor 2 detects , 
the outside temperature' of the robot. A.temperature sensor 3 and a contact sensor 4 are provided nearby as a pair., 
The contact sensor 4 detects the contact of the robot with an object, and the temperature sensor 3 detects the tern-. 

45 perature of the contacted object. A pressure-ser^sitive.sensor 5 detects.the strength of an external force (such as force ; 
applied by hItUng or that applied by patting) .applied to the robot. A wirid-speed serisor 6 detects the speed of wind, 
blowing outside the robot. An iiluminancesensor 7 detect illuminance putside the robot. An image sensor 8 is formed, 
for example, of a CCD,.and detects a scene outside the robot as anjmage signal. A sound serisor 9js fomned, for . 
example, of a microphone and detects sound. , 

50 [001 9] * The robot-motion-system control section 10 is fomried of a mption:system processing section 31 and a be- 
havior model 32, as shown in Fig. 2. and manages the operation of the robot. The motioh-system processing section 
31 compares the results of detection input from the various sensors 1, an internal everit generated in its inside/arid 
an instruction input from a robot-thinkingTsystem* control section 1 1 with the behayrar model 32 to change the behavior 
of the robpt, and outputs the current behavior state to an speech-synthesizing-cbntrol-infonTiation selection section 12 

55 as an behavior state., The motion-system" pro€:essing section 31 'also deterinines a behavior event according to the 
results of d^teptiori' input from the various sensors 1 , and outputs to the robpt-thinkingr^ystem control section 1 1 . Vyhen 
the result of detection achieved by the pressure-sensitive serisor 5 shows a force iequal to or more than a predetemnined 
threshold, for example, the motion-system processing section 31 determine that a behavior event is being hit on the 
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head. Furthermore, the motibri -system processing section 31 relays the""results'^ of detection sent from the viarious 
sensors 1,^to the robot-thin king-system control section 11. The various sensors 1 may directly input the results of 
detection to a thinking-system processi^^^ ■'' '■^ ^ - ^"'Oi ; - : ' ; - 

[0020] ,The behavior model 32 describes k condition used when the robcit criany'fe f rom a standard state to each of 
5 various behaviors, as shown in Fig. 3. When the Instruction "walk" is issued at the standard state, for example, a 
transition to the behavior "walking" odcurs. When the instruction "get up" is issued, a transition'to the behavior "getting 
up" occurs. When the internal event "operation finished" is generated if the specified behavior is finished, a transition 
to the standard state occurs. • • -^^^ ^.^ . ■ . - \ - j > . 

[0021] Back to Fig. 1 , the robot-thinking-system control isection 11 is fomried of thef thinking-system processing section; 
10 41 and an emotion modef42, as shown in Fig. 2, andrnjahage^s the emotion' of the robot. The thinking-system'proce 

section 41 compares a behavior event input from the nibtion-system processihg^section 31 ,= the resiilts of detection 
achieved by the various sensors 1, and ah internal event (such as' events* peribciicany' generated at ah interval of a 
fixed time period) generated in Its inside, with the emotion rnodel 42 to change the emotion of the roboti and outputs 
the current emotion to the speech-synthiBsi2irig<;ohtrbi-inforrnations'election s^^^^ 12 as an erhbtion statei. The think 
^5 ing-system processing sectiori 41 also outputs an instructibh related to a behavior to the m'otion-system processing 
section 31 in response to the i-esults of detection achieved tiy the various Isensors 1 . Fiirtherrriore, the thin king -system 
processing section 41 generates a text for speech-synthesizing '^t^ by the r-obbt in response to a behavior 

event and the results of detection achieved by the various sen^drs't; and outputs it to a language prbcessing section 
1 4, When the behavior event of "beirig hit on the head" occurs,' for exarhple/the' th in king-systerri processing secuon 
20 41 generates the text, "ouch.Mor speech-synthesizing. ^^^^^^^ ^ ' i • 

[0022] The.eniotion model 42 descrities a conditibnfused w standard state to each of 

various eniotions, as shown in Fig. 4. When.the behavior eyenit "being hit pn the head" occurs at the standard state, 
for exanriple, a transition to the emotion "angry" occurs. wl!ven thWbe^^ occurs; 
a transition to the emotion "happy" occurs. When an interrlal event is generated if a behavior everit does not occur for 
2s a predeterrnined time period or more, a transition to th^^ " , ■ - 

[0023] Back to Fig. 1, the speech-synthesizihg-contrpl-infb^ section 12 selects a field having the 

most appropriate speech-synthesizing-control information among m fields prepared in a speech-synthesizing^on-' 
trol-information table 13, according to a behavior 'stkte input from robot-mbtibn-system cohtrbl section 10 and an 
emotion state input from the robot-thinking-system control section l""!. Upon this selection,' a field rhay be seilected 
30 according to a parameter added, in addition to the operation state and the emotion state (details will be described l^ter 
by referring to Fig. 8 to Fig. id). _ ' ' . J n . ' * 

[0024] . The speech-synthesizing-controJ-infonTiatibh table' 1 3 has a numbei- of fields in response to all combinations 
of behavior states, emotion states, and other paranrieters (desciib^ !ater)^ .The speech-syhthesizing-control-ihforma- 
tion table 1 3 outputs the selection information stored in the field sie^^^^^ the speech-synthesizihg-control-ihfomna- 

35 tion selection section 12 to the language processing section. 14, and outputs speech-synthesizing control inforrnatiqn 
to a rule-based speech synthesizing section 15. ; ! . . . : ' 

[0025] . Each field includes selection information and speecrifsynt^ control ihformmion; as 'shown in Fig. 5! 

The selection information is formed of a word-nriapping-dictiorikry ID and a style-bbnversibn-rufe ID. The speech-syn- 
thesizing control inforrnation is formed of a^segrnent-d^ta ID, a syllable-set ID, a pitch ijarameter, a parameter bf t^^ 
intensity of accent, a parameter of the Intensity. of phrasify, and i^^^^ 

[0026] Word-mapping-dictionary IDs are prepared in iadvance in a wbrd-mappihg-dictionary dat^^ 54 (Fig. 6). 
Each of them is, infonrnation,to specify a dictionary to be used if) a vyord qoriversion section 53 (Fig. 6) among a plurality 
of dictionaries, such as a word mapping dictionary for baby talk; a word mappihg dictionary for the Osaka didlect, a 
word mapping dictionary for words used by girls in 'senior high schpbis, arid a word mapping dibtionary for words used 
for Irnltating pats. Word mapping dictionaries are switched according t personality infoirnatlbn, described later, of 
the robot, and are used for replacing words included in a tW for speech-syntheisizirig expressed In the standard lan- 
guage with other words. For, example, the word^mapping diclioriary for baby talk substitutes the word "biiubu" for the 
word *^kumma" included in a text for speech-synthesizing. ^ ^ 

[0027] Style-conversion-rule IDs are prepared in advance in a style-cpnversioh-rule database 56 (Fig. 6). Each of 
them is infornriatiori to specify a rule to be used in a style conversion section 55 (Fig. 6) among a plurality of rules, such 
as a rule of conversion to female words, a rule of conversion to male words, a rule of conversion to baby talk, a rule * 
of conversion to the Osaka dialect, a rule of conversion to words used by girls in senior high schools, and a rule of 
conversion to words used for imitating cats. Style coriversioh rules are switched according to the personality infomnatibn, 
described later, of the robot, and are used for replacing letter strings included in a text for speech-synthesizing with 
other letter strings. Fpr example, the sjtyle^rule of cor;* version to words used. for imitating cats substitutes the word "nya" 
for the word "desu" used at the end = . i';„ 

[0028] The segment-data ID included in the speech-synthesizipg control infomiatibh is infohriatioa usjSibfors 
a speech segment to be used in the rule-based speech synthesizing section 15. Speebh segrhents; are prepared in 
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advance in the rule-based speech synthesizing section 15 for female voice, male voice, child voice, hoarse voice, 
mechanical voice, and other voice, . , 

[0029] The syllable-set ID is information to specify a syllable set to be used by the rule-based speech synthesizing 
section 15. For example. 266 bas4^^)fUable sets.and 180 simplified syllable sets are prepared. The 180 simplified 
5 syllable sets have a'moreirestrictedVuniber of phonemes which can be uttered than the 266 basic syllable sets. With 
the 1 80 simplified syllable sets, forexannple, "ringo" inclucied in a text for speech synthesizing, input into the language 
processing section 14, is pronounced as "ningo." When phpnemes which can be uttered are restricted in this way, 
voice utterance of lisping infants can be expressed. 

[0030] . The pitch. parameter is information used.to specify the pitch frequency of a speech to be synthesized by the. 
10 rule-based spee<^ synthesizing sec^on 1 5.rThe p'ar^imeter of the intensity of accent is information used to specify the 
intensity of ap acceht of a^e:ech.j6.be synthesizW rule-based speech synthesizing section 15. When this 

parameter is large, utterance W jachie^^^ stroag aicpents. When the parameter is small, utterance'is achieved with 

weak accents. ; , * i ' » i • * . • . 

[003V] The parameter otthe Intensity of phrasify;is infornriation used for specifying the intensity of phrasify of aspeech , 

15 to be synthesized by tt?e mleiased.speech^ section 1 5, When this parameter is large, frequent phrasif ies * 

occur. When thp pj^rameter is' smail,. a occur, the utterance-sp^^ parameter is information used to 

specify the utterance speed of. a speech to be syntbpsl^^^ the rule-based speech siynthesizing section 15. 
[0032] Back to .Fig. ,t, the. language pjocessi^nq ^ec^Jon ^ i^, a^^'V^-^s a text for speech synthesizing Input from the 
robol-lhinking^syslem c»rt^^^^ section J V in terins gram'ma|, pb according to the speech- 

20 synthesizing control infonViatlon, and outputs to t^ 15. 

[0033] Fig.,6 shows an example structgre of the, language proce^ing section 14. The text for speech synthesizing' 
sent from the rp'bpt^inking-^^^^^ s.ection ifis mpl^ arialyzirig section 51 . The selection informatiori 

sent from the speechTsynthesizing-cpntrd^ the word conversion section 53 and to the 

style conversion section 65. The!style.a^ ^n analyzing dictionary 52 to apply morphological 

25 analysis to the text for speech synthesizing ^and pjA^^^^ "T^e analyzing dictionary 

62 describes inforrnation required fo^ rulerba?ed s^ such as reading of words {morphological ele- 

ments), accent types, and parts of spe^^ 

[0034] The word cpnvereion section 53>eads%e,dfj^i.6^ the word-nriapping-dlctionary ID includ- 

ed in the se|ectloin. information, fr^om the wpr<flr*Tia^^^ substitutes words specified in the read 

30 word mapping dictionary among the wpndsihdudecjjn synthesizing to which riiorphological analysis 

has been applied, sent from the style analyzing sectiori^SlV ahci outputs to the 'style conversion section 55. ^ 
[0035] The style conversion section.55 reads^the rule corresponding to the style-con version-rule ID included in the 
selection information,' f rp^ the .$tyle.K:onvers;i6n-aife jl^ converts the text for speech synthesizing to whicfi 

the word copversiori hasb^ applied, sent from' the word conversion section 53. according to tne read style conversion- 
's rule; and outpute.to the rule^based^^^^^^^ ' * • . . . 
[0036] ' Back to Fi^. 1 , the rule-based speech synthesizing section 1 5 synthesizes a speech signal corresponding to 
the text for speech synthesizing input frprp^tbe language processing section 14, according to the speech-synthesizing 
control information input from the speech ^sy nth esi^i^ table 13. the synthesized speech signal is 
changed to sound by a spealcer 16. , ' . ' \, 
40 [0037] A control section 1 /^controls a drive 18 to'read a contro) prograrh stored in a magnetic disk 1 9, an optical disk 
20, a magneto-optical disk 21 , or a serViicbhdudor m^^^ each section according to the read control 
■ program. 

[0038] The processing of the robpt to which an enibodiment of the present invention is applied will be described 
below by referring to a flowchart shown in Fig. /.This Vopessing starts, for example, when the pressure-sensitive 

45 sensor 5. qne of the various sensors 1 , detects a'<»ndrt^^^^ which the user hit the head of the robot, and the result 
of detection is input to the mQtion-s^fetenri processing section 3 of the robot-motion-systern processing section 10. 
[0039] In step. SI, the motion-systerri processing section 31 determines that a behaviour event "being hit bri' the- 
head" occurs, when the result of detection achievcfd by the pressure-sensitive sensor iS shows that a force equal to or 
more than a predetermiried threshold has been applied, and reports the deteitnination to 'the thinking-system process- 

50 ingsection41 of the robot-tiiinking-system control sectiori 1 1. The motiori-system processing section 31 also compares 
the behaviour event, "being hit on the head," with the behaviour model 32 to determine a robot behaviour "gettiiig up, 
" and outputs it as a beKaviour state to the speeM^h-synthesizing-control-informatioh seleciion section 12. * 
[0040] In step S2, the thiriking-system processing section 41 of "the robot-thin king-system control section 11 com- 
pares the behavior event, "being hit on the head," input from the motion-system processing section 31 . with the^motion 

55 model 42 to change the emotion to "angry." and ou^uts the current emotion as ah emotion state to the speech-syri- 
thesizingK:'orntrol-infonTiation selection section 1 2. The thinking-system processing section 41 also generates the text, 
"ouch." for speech synthesizing ip response'to the behavior event, "being hit on the head." and outputs it to the style* 
analyzing section 61 of the language process \ ' , 
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[0041] - In step S3, the speech-synthesizing-control-infomnation selection section 12 selects a field having the most 
appropriate speech-synthesizing control information among a number of fields prepared in the spe;ech-syntheslzing- 
control-information table 13, according to the behavior state input from the motion^ysteni processing section 31 and 
the emotion state input from the thinking-system processing section 41 . The.speech-synthesizing-control-infom 
table 1 3 outputs the selection infomnation stored in the selected field to the speech processing section 1 4, and outputs 
the speech synthesizing controlinformatiqn to the rule-based speech synthesizing section 15. \ '. 

[0042] In step S4, the style analyzing section 51 of the language processing section 1 4 uses the analyzing dictionary 
52 to apply morphological analysis to the text for speech synthesizing, and outputs to the word conversion section 53. 
In step S5, the word conversion section 53 reads the dictionary corresponding to the word-mapping-dictionary ID 
included in the.selection information, from the word-mapping-dictionary database 54; substitutes words. specified in 
the read word mapping dictionary arnong the words included in the text for speech synthesizing to which morphoiogical 
analysis has been applied, sent from the style analyzing section 51 ; and outputs to the style cpnyersiort section 55. \r) 
step S6, the style conversion section 55 reads the rule corresponding to the style-conversion-rule, ID included in the 
selection infonmatiori from the style-con version -rule database 56; converts the text for speech s;yntiiesizing to which 
word conversion has been applied, sent fr;om the word conversion seNCtion 53;. and putputs to the rule-based speech 
synthesizing section 15. _ ; , r j :i , . v 

[0043] In step S7, the rule-based speech synthesizing section 15 synthesizes a speech signal corresponding to the 
text for speech synthesizing input from the language processing, seiction 14, according to the speech-synthesizing- 
control, infonnation input from the speechrsynthesizing-contrqlrinfpfmation table 13, and changes it to a sound at the 
speaker 16. . . • v- v - 

[0044] With the above-described processing, the robot behaves as. if it h^d !te;erpotibri. The robot changes the way 
of speaking according to its behavior and the change of its ernotion.: • . : ' ^ 

[0045] .A method for adding a parameter other than the behayiorptate and the enriotionstate in the selection process 
of the speech-synthesizing-control-information selection section j 1:2^ wiH be described next by referririg to Fig. 8 to Fig. 
10.. ■; ■ . ■ ■' , . ■ --^Z ^..^\ .\ /'^ ' . ■■ 

[0046] Fig. 8 shows an example structure in which a cpnrimMnication port 61, a corrirnunication control section 62. 
and a personality infomnation memory 63 are added to thjS example structure shown in Fig. 1 to give the robot its 
personality. The communication port 61. is an interfaee for-transnrritting and receiving personality tnfonmation to and 
from an external apparatus (such as a personal computer),; and, can be, for example, one of those conforrning to 
communication standards, such as RS-232C, USB, ,and.lEEE X3?4: The cpnnmunication control section 62 controls 
information.communication with an external unit through the cornniunicatipri port 6t accprding to, a predetermined 
protocol, and outputs received personality information to the robpt-thinking-system control section 11 . The personality 
information memory 63 is a rewritable, non-volatile rriemory such as a flash: memory, and outputs stored personality 
information to the speech -synthesizing-controNnfonrnatipn; selection secti^ ; 
[0047] The following example items can be considered as personality information sen^^ 

Type:^;-. ; Dog/cat: , . . ■ • - ; . .s. \ \\ . ^' - -. v.': ■ r.- ■ ■ 

Gender:. _ Male/female ..... / :> • ^.-"ii; i.rv ^ -v.- --^ ^ .* • .■: 

Age: : ■ ^ v .Child/adult ^ j - : ; r ^ . _ . 

Temper: Violent/gentle 
Physical condition: Lean/ovenweight 

[0048] Each of these Items is stored in the personality information memory 63 as binary data, 0 or 1 . Each item may 
be specified not by binary data but by multi-valued data. 

[0049] To prevent personality Information from being rewritten very frequently the number of times It Js rewritten may 
be restricted. A password may be specified for rewriting. A personality information memory 63 formed of a ROM in 
which personality information has been written iPcadvance may be built in, at mangfacturing without providing the com- 
munication port 61 and the communication control section 62. . ) - 

[0050] With such a structure; a robot which outputs a voice different from that of another robot, according to the 

specified personality is implemented. , 

[0051]' Fig. 9 shows an example structure in which a timer 71 is added to the exarnple structure shown in Fig. 1 . The 

timer 71 counts the elapsed time from when the robot is first activated; and outputs the time to the speech^synthesizing- 

control-information selection section 12. The timer 71 may count the time in which the robot is being operated, from 

when the robot Is first driven. , , , 

[0052] With such a structure, a robot which changes an' output voice according to the elapsed tirne is implemented. 

[0053] Fig. 10 shows an example structure in which an. empiricalrvalue. calculation section Stand an empirical-value 

memory 82 are added to the example structure shown in Fig. 1 . The empirical-value calculation section 81 counts the 

number of times emotional transitions occur for each changed emotion state when the thinking-system processing 
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bemadetohaveapleKantwayofspeaking_ ^-^^^^^^^ ^^^^^^^^^^^^^ ... 

various funbtioris ^ VarioiiS programs are 'ri^'^Jg^:, .^y^.^i^K is distribLrted to'the use^ for providing the pro- 
ro0591 The recording medium is fomied of a package "^^"^ 3^^^ as a magnetic disk 19 (including a 

L°r^'^e2rate.Vffon?ihcc6mpu^^ 

Soppy disk), an optical disk 20 (mcludrng a ^D-ROM ^^mpa ^emo^^ 22, as shown m F.g^ 1^ 

disc))% magneto-optical disk 21 (includmg^^^^ 

poeo] in the present specificafibrirsteps J^^-^SSSr a«ording to a described order but also processes 
Lor^^yprd6esseswhk=hareexecutedmat.rne-se9^^ _ 

which are not necessarily achieved in a «''"«-^«''"^tSS ng apparatus; speech, synthesizing method and a 

Soeil AS described above; 1°,^^^^-^^^^^^ '"^'''^ .Ta tixt 

rf r tS^;:T^SSg^ SS^tS^Sd the..ehavlo. to im^e a l.ng thing more real . 
Sr^sofaras the embodiments Of the invention descrt^^^^^^^^^ 
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3. A speech synthesizing apparatus according to Claim 1 , further comprising detecting means for detecting an ex- 
ternal condition, : ' 

wherein the selecting means selects the control information also according to the result of detection achieved 
by the detecting means. 



A speech synthesizing apparatus according to Claim 1, further comprising holding means for holding individual 
infomnation, and t * i 

wherein the selecting means selects the control information also according to the individual information held 
by the holding means. , 

A speech synthesizing apparatus according to Claim 1 , further comprising counting means for counting the elapsed 
time from activation, and * ' ' . ; ^ , . .. : 

wherein the selecting means selects the control information also accordiiig to the elapsed time counted by 
the counting means. . ... „ 



6. A speech synthesizing apparatus according to Claim 1 , further comprising accuTiulating means for accumulating 
at least one of the number of times the behavlorrstate changing m^eans changes behavior states and the number 
of times the emotion-state changing means changes emotion states, and 1 

wherein the selecting means selects the control information also according to the number of limes accumu- 
20 lated by the accumulating means. ■ 7 ■ ■ : = 

7. A speech synthesizing iapparatus according to Claim 1 ..further comprising substituting means for substituting for 
words included in the text by using a word substitute dictionary corresponding to selection infomnation included in 
the control information selected by the selecting means. " 



. A speech synthesizing apparatus according to Claim 1 , further comprising Converting means for converting the 
style of the text according to a style conversion rule corresponding to selection infomnation included in the control 
information selected by the selecting mean?. ,^ I . j 

. A speech synthesizing apparatus according to Claim, 1 , wherein.' the speech synthesizing apparatus es a robot. 

1 0. A speech synthesizing method for a speech synthesizing apparatus for synthesizing a speech signal corresponding 
to a text, comprising; i ' i 

a behaviour-state changing step of changing a behaiyiour state according to a behaviour model; 

an emotion-state changing step of changing an emotion state according to ari emotion model; 

a selecting step of selecting control infonmation according to at least one otth^ behaviour state and the emotion 

state; and . ^ > / V t - . . ■ ' 

a synthesizing step of synthesizing a speech signaj corresponding to the text according to speech synthesizing 
infomnation included in the control infomnation selected by the process of the selecting step. 

11. A recording medium storing a connputer-readable speech-synthesizb prograrin for synthesizing a speech signal 
corresponding to a text, the program comprising: 



a behaviou restate changing 'step of ch4nging a behaviour state according to a behaviour model; 

an emotion-state changing step of: changing an ehnotion state according to an emotion model; 

a selecting step of selecting control information according to at least one of the behaviour state and the emotion 

state; and ; ; • , j 

a synthesizing step of synthesizing d speech signal corresponding to the text according to speech synthesizing 
information included in jthe control |nfomxation selected by the process of the selecting step. 
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FIG. 6 
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